-
Notifications
You must be signed in to change notification settings - Fork 2.4k
feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #3026
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #3026
Conversation
michaelneale
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good - but not 100% sure about removal of models
I cleaned them up because they're not available on GCP VertexAI model garden anymore. Anyone that still has access to them can still type it in. but let me add them back. cleanup is not a blocker for me. |
michaelneale
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good - not able to verify myself with vertex but looks good
|
will need to update to main branch |
6da2346 to
5f0bbd7
Compare
Updated this morning and now again - looks like it waits for approval on a couple of jobs. Hopefully no conflict when you see this next time 🤞 |
5f0bbd7 to
5d5cbb6
Compare
|
@uddhav just updated this - looking ok? |
@michaelneale thanks - LGTM |
|
nice one! thanks |
* main: feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code (#3026) Fix a few ui edge cases - refresh occasionally crashing, chat loader over text and chat input height returning to auto (#3469) Don't default to main for build-cli (#3467) docs: add MongoDB MCP server tutorial (#2660) feat: run sub recipe multiple times in parallel (Experimental feature) (#3274) chore(release): release version 1.1.0 (#3465) chore: implement streaming for anthropic.rs firstparty provider (#3419) Fix regression: add back detail to tool-call banners (#3231) Document release process and update some just recipes (#3460) feat: add download_cli.ps1 file for windows (#3354) fix: session_file is optional (#3462) Bump more space for goose is working on it so it doesnt overlap incoming agent chat messages (#3453) Align chat input action buttons to bottom when large amount of text (#3455) docs: add Cloudflare MCP Server tutorial (#3278) feat(cli): Clear persisted session file with /clear command (#3145)
…9 overloaded status code (block#3026) Co-authored-by: Michael Neale <[email protected]> Signed-off-by: Soroosh <[email protected]>
…9 overloaded status code (block#3026) Co-authored-by: Michael Neale <[email protected]> Signed-off-by: Kyle Santiago <[email protected]>
…9 overloaded status code (block#3026) Co-authored-by: Michael Neale <[email protected]>
…9 overloaded status code (block#3026) Co-authored-by: Michael Neale <[email protected]> Signed-off-by: Adam Tarantino <[email protected]>
Refresh of #2280
GCP Vertex AI: Add retry handling for Anthropic API 529 overloaded status code
Adds support for handling Anthropic's HTTP 529 "API Overloaded" status code in the GCP Vertex AI provider. This status code indicates temporary backend capacity issues rather than quota exhaustion.
Changes
This improves reliability when interacting with Anthropic models through the Vertex AI provider during high-traffic periods.